README.txt  
Replication Materials for: Re-Innovation Nation: The Political Economy of Technology Transfer Policy in Post-WTO China

Author: John David Minnich
Date: May 10, 2025
Journal: Journal of Politics

---------------------------------------------------
DESCRIPTION
---------------------------------------------------

This archive contains all code, data, and output files necessary to reproduce the results presented in the article, including all tables and figures in the main text and appendix.

The structure of the folders is as follows:

1. Code/
   - Contains all scripts used to clean/process the data, conduct the analysis, and generate the figures and tables used in the main paper.
   - Scripts are named and ordered to reflect the workflow from raw data to final output.

2. Data
   - Raw/: Contains the original, unprocessed data as obtained from sources (excludes proprietary data).
   - Processed/: Contains data files produced by scripts in the Code/ folder and used for analysis and visualization in the main paper.

3. Output/
   - Figures/: All figures that appear in the main paper.
   - Tables/: All tables that appear in the main paper.

4. Appendix/
   - Code/: Scripts used for appendix analyses.
   - Data/: Any additional data used for appendix-specific analyses.
   - Output/: Figures and tables presented in the appendix.

5. Logs/
   - A log file (replication_log.txt) encoding steps taken to load and process raw Chinese Customs Data and key outputs from table_1 and figure_4 in the main paper.  

---------------------------------------------------
SOFTWARE REQUIREMENTS
---------------------------------------------------

- Language: All analyses were conducted using R version 4.4.3 and RStudio version 2024.12.1+563.
- Running under: macOS Sequoia 15.3.2
- Packages/Libraries: See comments at the top of each script for a list of required packages.

---------------------------------------------------
CODE FILES
---------------------------------------------------

The replication archive contains the following R scripts: 

**Main Paper**

- 01_create_process_share.R: This script reads in raw Chinese import data from Chinese Customs Data (see Data section below) and creates a simplified measure of the median share of imports tied to export-processing trade by industry for use in 04_create_final_dataset.R

- 02_create_hhi.R: This script reads in raw data on industrial output by region in China and creates a Herfindahl–Hirschman Index (HHI) of the geographic concentration of industrial output for use in 04_create_final_dataset.R

- 03_create_soe_share.R: This script reads in raw data on industrial output by firm type in China and creates a measure of state-owned enterprises' (SOEs) share of industrial output for use in 04_create_final_dataset.R

- 04_create_final_dataset.R: This script reads in processed outputs from 01_create_process_share.R, 02_create_hhi.R, 03_create_soe_share.R, and original datasets on technology transfer policies and strategic industries and creates the final dataset used in subsequent analyses. 

- 05_create_total_isic.R: This script reads in the final_dataset.csv file and data on technology transfer policies and creates a measure of the total number of tech transfer policies in place by industry for use in subsequent analyses.

- 06_figure_1.R: This script reads in data on technology transfer policies and creates a bar plot of the number of tech transfer policies in place by year for use in creating figure_1.pdf

- 07_figure_2.R: This script reads in data on technology transfer policies and creates a stacked bar plot of the number of tech transfer policies in place by type of policy and year (figure_2.pdf)

- 08_figure_3.R: This script reads in data on technology transfer policies, strategic industries, and export-processing trade dependence to create figure_3.pdf

- 09_table_1.R: This script reads in final_dataset.csv and total_isic.csv to run the analysis for and produce table_1.tex

- 10_figure_4.R: This script reads in final_dataset.csv to create figure_4.pdf

- 11_figure_5.R: This script reads in final_dataset.csv to create figure_5.pdf

**Appendices**

- B1_create_table_1.R: This script reads in the full list of strategic industries to create B_table_1.tex

- B2_create_alt_strategic.R: This script reads in data on an alternative measure of strategic industry based on Chinese government policy documents to create final_dataset_alt_strategic.csv for use in B3_create_figure_3.R

- B3_create_figure_3.R: This script reads in final_dataset_alt_strategic.csv to create B_figure_3.pdf

- B4_create_table_2.R: This script reads in a list of industries targeted by Chinese central state policies to create B_table_2.tex

- B5_create_table_3.R: This script reads in a list of isic industries corresponding to industries targeted by Chinese central state policies to create B_table_3.tex 

- C1_create_rnd_only.R: This script reproduces final_dataset.csv including only R&D-intensive industries as strategic to create final_dataset_rnd_only.csv

- C2_create_figure_4: This script reads in final_dataset_rnd_only.csv to create C_figure_4.pdf

- C3_create_gp0911_only.R: This script reproduces final_dataset.csv including only government procurement policies from 2009-2011 to create final_dataset_gp_0911.csv

- C4_create_figure_5: This script reads in final_dataset_gp_0911.csv to create C_figure_5.pdf

- C5_create_figure_6: This script reads in data on ownership restrictions and processing trade dependence to create time trends plots in C_figure_6.pdf

- C6_create_figure_7: This script reads in data on local content requirements and processing trade dependence to create time trends plots in C_figure_7.pdf

- C7_create_figure_8: This script reads in data on government and processing trade dependence to create time trends plots in C_figure_8.pdf

- C8_create_or_only.R: This script reproduces final_dataset.csv including only ownership restrictions to create final_dataset_or_only.csv

- C9_create_figure_9.pdf: This script reads in final_dataset_or_only.csv to create C_figure_9.pdf

- C10_create_lcr_only.R: This script reproduces final_dataset.csv including only local content requirements to create final_dataset_lcr_only.csv

- C11_create_figure_10.R: This script reads in final_dataset_lcr_only.csv to create C_figure_10.pdf

- C12__create_gp_only.R: This script reproduces final_dataset.csv including only government procurement to create final_dataset_gp_only.csv

- C13_create_figure_11.R: This script reads in final_dataset_gp_only.csv to create C_figure_11.pdf

- C14_create_figures_12_13.R: This script reads in data on NDRC and State Council policies and central state revenue to create C_figure_12.pdf and C_figure_13.pdf

- C15_create_figure_14.R: This script reads in final_dataset.csv to create C_figure_14.pdf

- C16_create_figures_15_16.R: This script reads in final_dataset.csv to create C_figure_15.pdf and C_figure_16.pdf

- C17_create_figure_17.R: This script reads in final_dataset.csv to create C_figure_17.pdf

- D1_create_table_4.R: This script reads in final_dataset.csv to create D_table_4_wrapped.tex

- E1_create_marketsize.R: This script reads in data on Chinese and world imports, processing trade dependence, and final_dataset.csv to create final_dataset_marketsize.csv

- E2_create_table_5.R: This script reads in final_dataset_marketsize.csv to create E_table_5_wrapped.tex

- E3_create_table_6.R: This script reads in final_dataset_marketsize.csv to create E_table_6_wrapped.tex

---------------------------------------------------
DATA FILES
---------------------------------------------------

The replication archive contains the following data files, arranged in order of their appearance in the preceding R Scripts: 

**Main Paper**

- isic_hs4_correspondence.csv: A correspondence table for the International Standard Industrial Classification (ISIC) system revision 4 and Harmonized System (HS)

- provincial_output.csv: Raw data on industrial output (among other indicators) by province/region in China 

- nbs_soe.xlsx: Raw data on SOE industrial output (among other indicators) from China's NBS

- nbs_total.xlsx: Raw data on industrial output (among other indicators) from China's NBS

- cic_isic_correspondence_table.csv: A correspondence table for ISIC Rev. 4 and Chinese Industrial Classification (CIC) system

- isic_list: A list of all 420 ISIC 4-digit industry codes

- strategic_isic.xlsx: An excel workbook with 3 sheets corresponding to each category of ISIC 4-digit industries (manually coded) used to create measure measure of strategic industry

- dv_clean.xlsx: An excel workbook with 3 sheets corresponding to each type of technology transfer policy used to create measure of tech absorption policies and their corresponding ISIC 4-digit industry codes (manually coded)

- hhi_final_csv: A processed measure of geographic concentration of industrial output, from provincial_output.csv

- soe_share.csv: A processed measure of SOE's share of industrial output, from nbs_soe.xlsx and nbs_total.xlsx

- totaL_isic.csv: A dataset based on a processed measure of the total # of tech absorption policies by industry based on dv_clean.xlsx

**Appendices** 

- strategic_industry_list.csv: A full list of ISIC 4-digit industries coded as strategic

- cn_policies_strategic.xlsx: An excel workbook containing data on industries targeted in 3 Chinese national industrial policies and their corresponding ISIC 4-digit industry codes (manually coded)

- final_dataset_alt_strategic.csv: A version of the final dataset including the alternative measure of strategic industry based on cn_policies_strategic.xlsx

- final_dataset_rnd_only.csv: A version of the final dataset including only R&D-intensive industries as strategic

- final_dataset_gp_0911.csv: A version of the final dataset including only government procurement policies from 2009-2011

- final_dataset_or_only.csv: A version of the final dataset including only FDI ownership restrictions as the outcome

- final_dataset_lcr_only.csv: A version of the final dataset including only local content requirements as the outcome

- final_dataset_gp_only.csv: A version of the final dataset including only government procurement as the outcome

- cn_govt_policies_count.xlsx: An excel workbook containing data on the number of policies issued by the NDRC and State Council and central state revenues by year

- cn_imports.xlsx: A dataset with Chinese imports by HS4-digit product category and year

- world_imports.xlsx: A dataset with world imports by HS4-digit product category and year

- final_dataset_marketsize.csv: A version of the final dataset including measures of China's share of world imports

---------------------------------------------------
DATA SOURCES
---------------------------------------------------

- The measures of industry-level variation in China's export-processing trade dependence used in the article ('median_share' and 'processing_share' created with '01_create_process_share.R' from ccd_isic_processing.csv) draw on data from the RESSET China Customs Import and Export Full Port Database, more commonly known as the Chinese Customs Data database. The data is proprietary so the raw data files used to create the measure are not included in the replication archive. The data can be accessed via many Chinese university library systems or purchased separately. For more information on how to access this dataset outside of Chinese university library systems, see: https://www.resset.cn/index/home/

- The measures of geographic concentration and SOE share of industrial output used in the article draw on data from China's National Bureau of Statistics accessed through through the All China Marketing Research China Data Online portal (via the London School of Economics library system). The data itself is publicly available and is included in full in the 'Data/Raw/' folder (China Data Online simply provides a user-friendly interface). For more information on how to access this data, see: https://www.china-data-online.com/acmr-cndata-pub/home/homes.htm?auto=1

- The measure of R&D-intensive industries used to create the measure of strategic industries used in the article draws on data from The Global Innovation 1,000 Study by the consultancy Strategy&. For more information on how to access this data, see https://www.strategyand.pwc.com/gx/en/insights/innovation1000.html

- The measures of the number of policies issued by China's State Council and National Development and Reform Commission used to create Figure 12 in Appendix C draw on data from the PKU Laws and Regulations Database. The PKU LRD is a proprietary database that can be accessed via a number of Chinese and overseas university library systems. For more information on how to purchase access to the LRD separately, see: https://www.pkulaw.com/

- The measure of Chinese central government revenues by year used to create Figure 13 in Appendix C draws on data from China's National Bureau of Statistics, accessible here: https://data.stats.gov.cn/english/tablequery.htm?code=AC07

- Data Chinese and world imports used to construct measures of China's market share for the analysis in Appendix E were constructed using UNCTAD trade data accessed via the ITC's trademap tool. The data is accessible here: https://www.trademap.org/Index.aspx

---------------------------------------------------
NOTES
---------------------------------------------------

- Scripts are commented for clarity and transparency.
- If you encounter issues reproducing the results, please contact j.minnich@lse.ac.uk.

---------------------------------------------------
CITATION
---------------------------------------------------

If you use these materials, please cite the article as:

[Full citation of the article]

